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Abstract 

We propose Dijkstra's algorithm with bounded 
list size after QR decomposition for decreasing 
the computational complexity of near maximum- 
likelihood (ML) detection of signals over multiple- 
input-multiple-output (MIMO) channels. After that, 
we compare the performances of proposed algorithm, 
QR decomposition M-algorithm (QRD-MLD), and 
its improvement. When the list size is set to achieve 
the almost same symbol error rate (SER) as the 
QRD-MLD, the proposed algorithm has smaller av- 
erage computational complexity. 

1 Introduction 

The channel capacity of multiplc-input-multiple- 
output (MIMO) channels linearly increases with the 
number of antennas [UH]. Maximum-likelihood (ML) 
detection provides the minimum error rate. However, 
the computational complexity of the simple ML de- 
tection algorithm grows exponentially with the num- 
ber of transmit antennas. Thus, we need an effi- 
cient algorithm that achieves similar error rate to the 
ML detection. The QR decomposition M-algorithm 
(QRD-MLD) [51 H] and sphere decoding (SD) [3] are 
possibly the most promising algorithms. In [10j . to 
reduce the computational complexity, Dijkstra's al- 
gorithm is applied to SD which achieves same error 
rate as ML detection. Both the QRD-MLD and Di- 



jkstra's algorithm are tree search based algorithms. 
Dijkstra's algorithm uses the list of unlimited size to 
keep detection candidates. However, the computa- 
tional complexities of the QRD-MLD and Dijkstra's 
algorithm are still high. To reduce the computa- 
tional complexity, we propose Dijkstra's algorithm 
with bounded list size. When proposed algorithm's 
list size is set to achieve the almost same symbol er- 
ror rate (SER) as the QRD-MLD, the computational 
complexity of proposed algorithm is lower than the 
QRD-MLD. 

This paper is organized as follows. In Section 2, 
we introduce the system model of MIMO channels. 
In Section 3, we review the QRD-MLD and its im- 
provement, then propose Dijkstra's algorithm with 
bounded list size. In Section 4, we show the com- 
parison between the computational complexity of the 
QRD-MLDs and proposed algorithm by computer 
simulations. Finally, we give the conclusion in sec- 
tion 5. 



2 System model 

We consider the uncoded system with t transmit an- 
tennas and r receive antennas, and we assume r > t. 
We assume that the noise at each receive antenna is 
the additive white Gaussian noise (AWGN). Let x 
be a t x 1 vector consisting of complex envelopes of 
transmitted signals with the signal constellation S, H 
anrxi fading matrix whose (k, j) entry is a complex 
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fading coefficient between j-th transmit antenna and 
k-th receive antenna, zanrxl complex vector whose 
component is noise at each receive antenna, and y an 
r x f complex vector whose component is the received 
signal component at each receive antenna. The model 
of this channel is written as 

Y = Hx + z. (1) 

We assume that the receiver knows the channel state 
information H perfectly. 

In this case, the ML detection of the transmitted 
signal over the channel (1) can be formulated as find- 
ing 

x m ; = argmin||y - Hx|| 2 . (2) 

xes f 

3 Near ML detection algorithm 

In this section, we propose the new near ML detec- 
tion algorithm. First, to calculate (2) efficiently, we 
explain how to find the ML signal by tree search al- 
gorithm in Section 3.1. Then, we review near ML 
detection algorithms called QRD-MLD [H [6] and its 
improvement [5] in Section 3.2. Finally, we propose 
Dijkstra's algorithm with bounded list size in Section 
3.3. 

3.1 QR decomposition 

To calculate (2) efficiently, we compute a QR decom- 
position of H and obtain an upper triangular matrix 
R and a unitary matrix Q with H = QR. Since Q is 
unitary, 

||y - Hx|| 2 = ||q*y - q*Hx|| 2 = ||Q*y - Rx|| 2 . (3) 

Let £ = Q*y = • • • , £ r ) T . The ML detection prob- 
lem (2) can be reformulated as finding 

x m t = axgmin||£ — Rx|| 2 
xes* 

= arg min J ^ \^ - ^ R jt iXi | 2 + ^ |& | 2 1 

XGS \j=l i=j k=t J 

™^ I 51 & " R j,i x i\ 2 \ • ( 4 ) 



The second equality above follows as the second term 
in the second equation is irrelevant to x. 

To calculate (4) efficiently, we consider a weighted 
directed graph as follows. The decisions on n con- 
struct a tree where nodes at fc-th depth are corre- 
spond to the candidate of Xt-h+i @], and the root 
node is placed at depth 0. Then, the metric value, 
which is the weight of branch, between a node X\ that 
has Xt,- • ■ ,Xj+i(xfc G S, i + 1 < k < t) as ancestor 
nodes from the root node to its parent node is defined 

by 

t 

j=i+l 

The distance of each node from the root node, which 
is called the accumulated metric value in this paper, 
is equal to the sum of the metric values of branches 
from the root node to the node itself. The accumu- 
lated metric value from the root node to the bottom 
node whose depth is t is 

t t t 

^772, =5^|£j -^2 K J,iXi\ 2 - ( 5 ) 

i=l j=l i=j 

Because x that makes (5) minimum is equal to x m ; 
of (4), the shortest path from the root node to the 
bottom node corresponds to the ML signal 0]. 

3.2 QRD-MLD 

The QRD-MLD [3[6], which is a breadth-first tree 
search based algorithm, finds a near ML signal. The 
QRD-MLD keeps only M nodes at each depth with 
the smallest accumulated metric values [7] , instead of 
testing all the candidate in S* according to (4). At 
each depth, only M nodes make their child nodes. 
We call a node that makes its child node detection 
node in this paper. 

An improvement to QRD-MLD proposed in [S] re- 
duces the number of detection nodes from the original 
QRD-MLD. This improved QRD-MLD has threshold 
value at each depth. The depth i 's threshold value 
A, is defined by 

A i = E i<min + X(f> 2 , (6) 
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where Ei >m i n is the smallest accumulated metric 
value of the node at i-ih depth in the nodes whose 
parent node is a detection node. X is a fixed constant 
number, and (j) 2 is the noise variance. At each depth, 
select the nodes that have smaller accumulated met- 
ric value than threshold value Aj. If the number of 
selected nodes is more than M, only M nodes with 
smallest accumulated metric values are selected. 

Note that both algorithms do not always find the 
ML signal. For small to medium M values, the com- 
plexity is substantially lower than the simple ML de- 
tection algorithm. However, the final result is no 
longer guaranteed to be the ML signal. 

3.3 Proposed algorithm: Dijkstra's 
algorithm with bounded list size 

Dijkstra's algorithm is an efficient algorithm to find 
the shortest path from a point to a destination in a 
weighted graph .9 . Dijkstra's algorithm uses the list 
of unlimited size to keep candidate nodes. If we use 
Dijkstra's algorithm to find the shortest path from 
the root to one of nodes at the bottom depth, wc 
can get the node with minimum ||y — Hx| | 2 among all 
nodes at the bottom depth and it corresponds to the 
ML estimate [10]. However, this algorithm still has 
high computational complexity. To reduce the com- 
putational complxity, we propose a modified version 
of Dijkstra's algorithm whose list keeps only L nodes 
with the smallest accumulated metric values in the 
list. 

We show Dijkstra's algorithm with bounded list 
size. 

1. Create an empty list for nodes. 

2. Insert all nodes at the first level into the list. 

3. Select the node A having smallest accumulated 
metric value in the list and remove it from the 
list. If the depth of A is t, then output the node 
A and its ancestor nodes as the ML signal and 
finish this algorithm. 

4. Insert all A's child nodes into the list. 

5. Arrange the nodes in the list according to the ac- 
cumulated metric value by the quick sort. If the 
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Figure 1: (4x4) symbol error rate 

list has more than L nodes, select the L nodes 
with the smallest accumulated metric values in 
the list, and discard other nodes from the list. 

6. Go back to Step 3. 

The node whose child nodes are inserted into the 
list is called detection node in this paper. Because the 
discarded nodes, which are decided at Step 5, and 
their descendant nodes are not examined, the pro- 
posed algorithm dose not examine all the candidate 
in S* according to (4). Thus, the proposed algorithm 
dose not always find the ML signal. 

When we use LDPC codes [12] or turbo codes [13] 
after detection, we have to compute N most likely 
signals [11. Such signals can be computed by this 
algorithm's modification that is finished after out- 
put TV signals with the smallest accumulated metric 
value. 

4 Computer simulation 

In this section, we compare the computational com- 
plexity, the number of detection nodes and the num- 
ber of comparisons of real numbers among the pro- 
posed algorithm and the QRD-MLDs. Throughout 
the simulations, we consider the following system 
model. 
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Figure 2: (4 x 4) average computational complexity Figure 4: (4 x 4) average number of detection nodes 
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Figure 3: (4x4) maximum computational complexity Figure 5: (4x4) maximum number of detection nodes 



• We do two simulations. In the first simulation, 
the number of transmit antennas t = 4, and the 
number of receive antennas r = 4. In the second 
simulation, the number of transmit antennas t = 
6, and the number of receive antennas r = 6. 

• The signal constellation at each transmit an- 
tenna is 16- QAM and all signals are drawn ac- 
cording to the uniform i.i.d. distribution. 

• The fading coefficients obey the CN(0, 1) distri- 
bution, and the receiver knows it perfectly. 

• The noise at each recieve antenna obeys the 



CN(0, <j)) distribution. <fi is caluculated by </> 2 = 
tE s x lo(~ SNR / 10 \ where E s is the average sym- 
bol energy. 

• We transmit 100000 signals, which is 400000 
symbols if the number of transmit antennas is 
4 and 600000 symbols if the number of transmit 
antennas is 6, and every 100 signals, change the 
fading matrix. 

If M = 16 is used and the signal constellation is 
16-QAM, QRD-MLD has symbol error rate (SER) 
near to the ML detection [7]. So, we use M = 16. 
In QRD-MLD's improvement, we use X = 2 in (6) 
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Figure 6: (4x4) average number of comparisons of 
real numbers 
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Figure 7: (4 x 4) maximum number of comparisons 
of real numbers 



as used in [8|. In order for the proposed algorithm 
to have the similar SER to QRD-MLD and its im- 
provement, we use two versions of proposed algorithm 
whose list sizes are L — 16 and L — 5. Figures 1 and 
8 show that the proposed algorithm with L = 16, 
the original QRD-MLD and the ML algorithm have 
almost the same SER throughout this simulations. 
The proposed algorithm with L = 5 and QRD-MLD's 
improvement also have similar SER throughout this 
simulations. 



Figure 8: (6 x 6) symbol error rate 



We count the number of multiplications and divi- 
sions of complex numbers as the computational com- 
plexity. Since the part of QR decomposition is the 
common part of all compared algorithms, we do not 
include that part in comparison of complexity. 

In QRD-MLDs, we use the quick sort to arrange 
the nodes and decide M nodes with the smallest ac- 
cumulated metric value at each depth. 

Because the QRD-MLD keeps M nodes at each 
depth, the number of detection nodes and the com- 
putational complexity are completely determined by 
M. However, in the proposed algorithm and QRD- 
MLD's improvement, the number of detection nodes 
and the computational complexity are not fixed. 

Figures 1-7 are the results of first simulation whose 
number of transmit antennas and receive antennas 
are 4. Figures 8-14 are the results of second simula- 
tion whose number of transmit antennas and receive 
antennas are 6. 

At first, we discuss the result of first simulation. 
According to Figures 2, 4 and 6, the propose algo- 
rithm with L = 16 reduece the average computational 
complexity, average number of detection nodes and 
average number of comparisons of real numbers from 
the original QRD-MLD. Moreover, in the case of high 
SNR, although the proposed algorithm with L = 16 
has much smaller SER than QRD-MLD's improve- 
ment according to Figure 1, the average computa- 
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Figure 9: (6 x 6) average computational complexity 
>> 1 2000 



Figure 11: (6x6) average number of detection nodes 
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Figure 10: (6 x 6) maximum computational complex- Figure 12: 
ity nodes 
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tional complexity of proposed algorithm with L = 16 
is almost the same as QRD-MLD's improvement. In 
the case of low SNR, the average computational com- 
plexity, average number of detection nodes and aver- 
age number of comparisons of real numbers of the 
proposed algorithm with L — 5 are lower than QRD- 
MLD's improvement. In the case of high SNR, the av- 
erage computational complexity, average number of 
detection nodes and average number of comparisons 
of real numbers of proposed algorithm with L = 5 
are almost same as QRD-MLD's improvement while 
the proposed algorithm has smaller SER according 



to Figure 1. According to Figures 3, 5 and 7, in 
the case of low SNR, maximum computational com- 
plexity, maximum number of detection nodes and the 
maximum number of comparisons of real numbers of 
the proposed algorithm with L = 16 are higher than 
QRD-MLDs. However, because the average compu- 
tational complexity, the average number of detection 
nodes and the average number of comparisons of real 
number of the proposed algorithm with L = 16 are 
lower than QRD-MLDs, we find that the proposed 
algorithm rarely gets high computational complex- 
ity, large number of detection nodes or large number 
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ror rate (SER) as the original QRD-MLD, the aver- 
age of the computational complexity and the number 
of detection nodes are reduced. When the list size 
is adjusted so that the proposed algorithm has the 
almost same symbol error rate (SER) as the QRD- 
MLD 's improvement, in the case of low SNR, both 
the average computational complexity and average 
number of detection nodes are reduced and in the 
case of high SNR, the computational complexity and 
average number of detection nodes of proposed algo- 
rithm is almost same as QRD-MLD 's improvement 
while SER of the proposed algorithm becomes smaller 
than QRD-MLD 's improvement. 



Figure 13: (6 x 6) average number of comparisons of 
real numbers 




5 10 15 20 25 30 
SNR at single receive antenna [dB] 



Figure 14: (6 x 6) maximum number of comparisons 
of real numbers 

of comparisons of real numbers. 

According to Figure 8-14, which is the result of 
second simulation, the characteristic of proposed al- 
gorithm dose not change with the number of anten- 
nas. 

5 Conclusion 

In this paper, we propose a near ML detection al- 
gorithm. When the list size is adjusted so that the 
proposed algorithm has the almost same symbol cr- 
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